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ABSTRACT 

A  procedure  is  presented  in  this  paper  for  treating 
the  nongeolocational  aspects  of  multi-target  correl¬ 
ation  and  tracking  based  on  the  well  developed  cal¬ 
culi  of  possibility  theory  and  certain  new  theoret¬ 
ical  considerations.  More  generally,  it  is  shown 
that  a  combination  of  possibili6tic  and  probabilis¬ 
tic  techniques  leads  to  both  a  feasible  and  optimal 
parameter  estimation  algorithm.  The  technique 
incorporates  observed  data,  error  distributions -or 
equivalently,  matching  tables  for  attribute  outcome 
values -and  inference  rules  connecting  attribute 
matching  intensities  with  consequent  correlation 
levels 


1.-  INTRODUCTION 

The  multi-target  multi-sensor  target  data  associa¬ 
tion  or  "correlation"  problem  remains  as  one  of  the 
chief  obstacles  in  constructing  a  comprehensive 
theory  of  surveillance.  A  survey  of  the  state -of  - 
the  -art  (unclassified)  may  be  found  in  the  Naval 
Ocean  Surveillance  Correlation  Handbook  (two  ed¬ 
itions)  ri,2l .-  A  good  number  of  previous  and  present 
approaches  to  the  correlation  problem  are  based 
upon  classical  Bayesian  statistical  techniques ..  Co 
the  other  hand,  some  approaches  to  the  problem  do 
rely  upon  heuristic  procedures  or  mixtures  of  heur¬ 
istic  and  statistical  principles.,  (See  again  11,2] 
for  descriptions  of  these  systems.-  In  addition,  see 
[3,1*]  for  an  excellent  example  of  the  total  Bayesian 
approach . ) 

In-  any  case,  it  appears  that  a  large  percentage  of 
correlation  problems  involve  vague  or  linguistic 
information  which  is  not  easy  to  model  from  a  pure 
statistical  viewpoint.. 

As  an  example  of  the  above  statements,  consider  the 
following  four  attributes  which  are  commonly  involv 
ed  in  informational  Inputs  relative  to  tracking:,  A^=* 
class,  A 2=  frequency  of  signal  at  its  source,  Aj  = 
ship  mode,  and  A^  =  geolocation  with  •  oufidence^ 
ellipse.;  The  natural  domains  of  vclues  of 

these  attributes  are  typically:  dom^A^)  =]C^,..,C ^ , 
each  a  latel  for  a  category  of  ship  ;  dom(A^): 
Interval  (0,M  ]  ,  where  M  is  some  suitably  chosen 
upper  bound  (?n  hz.);  dom?A_)  «  fD^,..,Dn}  ,  each 
D,  being  a  label  for  a  mode^of  operation,  noting  the 
highly  overlapping  flavor  in  general  possessed  bv  the 


C,  1 s  and  D  's,  where  6ome  could  actually  represent 
s(h>categories  with  respect  to  others;'  dom(A, )= 
l(p>Ep)  |  p  any  point  in  R  ,  E  any  confidence  el¬ 
lipse  centered  at  p  ;  each  E  ^has  the  same  fixed 
probability  level}  Next,  ?et  i  and  J  represent 
two  fixed  track  histories.  That  is, each  letter  rep¬ 
resents  a  collection  cf  data  from  possibly  several 
different  sensor  and  intelligence  sources  which 
is  assumed  to  correspond  r.o  the  same  (usually  un¬ 
known)  target  source.-  This  data  may  be  oiacsn  icd 
into  the  four  types  of  attributes  mentioned  anove ., 

In  addition,  it  is  assumed  that  error  distributions 
-  or  equivalently,  matching  level  tables-  are  ob¬ 
tainable  for  each  of  the  types  of  observed  data. 
Finally,  it  is  assumed  that  prior  known  relations 
are  available  connecting  the  intensities  of  matches 
between  any  passible  outcomes  of  attribute  categor¬ 
ized  data  between  i  and  J  and  consequential  levels 
of  correlation  between  i  and  j.  Usually,  the  latter 
is  in  the  form  of  inference  rules.  Both  matching 
tables  and  inference  rules  may  be  obtained  either 
analytically  ,  using  physics  and  geometrical  con¬ 
straints,  or  empirically,  through  the  establishment 
of  a  panel  of  experts.  The  term  "distribution"  as 
used  above  may  refer  to  classical  probabilistic  or 
possibllistic/fuzzy  set  definitions.-  (SeelJJfor  a 
survey  and  summary  of  possibilisi ic  distributions 
and  properties.)  Thin, some  statistic  (in  the  gener¬ 
al  sense)  is  sought,  which  will  estimate  the  unknown 
correlation  level  between  i  and  j  ,  based  upon  the 
available  data,  catching  tables,  and  inference  rules. 

The  procedure  presented  in  this  paper  is  based  upon 
three  general  theoretical  types  of  results,  obtained 
previously  by  the  author: 

(a)  Fuzzy  sets  and  their  operators  correspond 
in  a  natural  way  to  random  sets  and  their 
operators  such  that  fuzzy  set/possibili stic 
modeling  in  effect  is  a  weakened  form  of 
probabilistic  modeling,  thus  allowing  for 
interchange  between  the  uwo  types  of  model¬ 
ing.  This  result  leads  to  the  procedure 
where  all  input  information  to  the  correla¬ 
tion  problem  is  converted  seperately  to 
Dossibillstic  forms  connected  by  ordinary 
^two-valued)  logical  relations-usually, 
conjunction.  Then,  following  the  applica¬ 
tion  of  the  algorithm  (described  below)  , 
the  initial  outputs  in  possibilistic  form 
are  reconverted  to  probabilistic  form,  if 
desired.  (See  L 6 1  and  (71, ) 
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(b)  Given  input  information  consisting  of  an 
ordinary  logical  combination  of  possibil- 
istic  descriptions  of  an  unknown  parameter 
vector,  a  uniformly  most  accurate  pure 
possjbiliutic  description  exists  which  is 
obtainable  by  replacement  of  all  ordinary 
connectors  by  corresponding  (appropriate) 
fuzzy  6et  ones.  This  description  can  be 
shown,  under  sufficient  conditions,  to 
yield  an  asymptotically  consistent  estima¬ 
tor  of  the  parameter  in  question,  with 
computable  error  bounds  involving  the  or¬ 
iginal  description  and  the  pure  possibil- 
istic  one.  This  result  forms  the  basis  of 
the  structure  of  PACT.  (See(83;C9!l ,  sect.Xl) 

(c)  Under  very  general  conditions,  conditional 
fuzzy  sets  may  be  constucted,  analagoos 

to  conditional  random  variables.  In  turn, 
this  leads  to  a  possibilistic  version  of 
Bayes  theorem.  (See(9J>  C 10 3  .)  then, 
with  the  identification  of  inference  rules 
with  posterior  parameter  distributions 
(and  matching  table  forms  with  posterior 
data  distributions),  it  can  be  shown  that 
the  output  description  of  the  correlation 
(the  unknown  parameter  here)  as  estab¬ 
lished  in  result  (b)  is  essentially  the 
same  as  the  posterior  distribution  of  the 
correlation  in  the  possibilistic  Bayes¬ 
ian  sense.;  (See  til]  .) 


2.  GENERAL  DIAGNOSTIC  SYSTEMS 

Before  exhibiting  the  structure  iw  of  the  PACT  algo¬ 
rithm,  we  will  consider  a  more  general  diagnostic 
system  which  encompasses  not  only  the  pScV  algo¬ 
rithm,  but  other  applications,  including  classifi¬ 
cation,  system  diagnosis,  and  medical  daignosis 
techniques . 


Let  attributes 
formation  over  w 
gorized.  Thus  we 


A  ,  Ap,...,  A  be  m  types  of  in- 
hich  observed  9ata  Z  can  be  cate- 
vnte  in  partitioned  form 


A 


L, 


(U 


where  \  is  observed  from  the  domain  of  A.  , 
dom(Ak),  for  k=l,...,m.;  is  assumed  dom(A  )  is 
known.  Corresponding  Zk  we  denote  as  a  variable 
Z^any  possible  value  Zk could  have  taken  in  dom(Ak); 
similarly  for  Z. 


Let  Q  denote  the  unknown  parameter  vector  of  inter¬ 
est.,  Denote  the  matching  table  (or  by  a  simple 
transform,  the  error  distribution)  for  attribute  A,, 
by  M,  .  Typically,  M  is  evaluated  as  a  number 
hftvppn  0  and  1  ^ 

0  -  £  1  • 

Define  symbolically  R  to  correspond  to  the  tth 
fuzzy  relation  connecting  any  Z  with  Q.  Specific¬ 
ally, 


B  •:  dom(Ak  )  X  0o.a(9)  — v[0,l],  (3) 

L  v=l  v 

where  1  4  <  k^  < . <1^  <m  represents  the  col¬ 

lection  of  attributes  involved  in  the  tth  relation 
Rt  Typically,  is  evaluated  (clearly,  as  a  mem¬ 
bership  function)  as  a  number  between  0  and  1 

0  S  Rt(Z,Q)<  1  ,  U) 

with  some  abuse  of  subscript  notation!  Note  that 
formally  M.  and  R  are  possibility  distributions 
(or  equivalently fuzzy  set  membership  functions) 

We  may  think  et  corresponding  to  the  following 
linguistic  description 

=  possibility  that  is  the  ,rue 

^  value,  when  data  ^  is  observed, 

noting  both  Zk  and  t  dom(A^). 

Similarly,  we  may  interpret 

R  ( Z , Q )  =  possibility  that  Z  (through  at- 

1  tributes  A.  , .  .  ,AV  )  and  Q  are 

related.  1  U  (6) 


Theorem  1..  uniformly  Most  Accurate  Estimators 


Suppose  that  information  concerning  unknown  param¬ 
eter  Q  consists  of  the  following  forms: 

( 1 )  Dctta  ?  . 

(ii)  Matching  tables  M.  ,  k=l,..,m. 

(in)  Relations  R,  ,  t  =l,..,r. 

Let  g:  LO ,  1 J X  •-)(10,1J - >[0,l]  be  ncndecreasing 

( m+r  factors) 

with  respect  to  the  partial  ordering  of  vectors..  In 
particular,  g  can  be  any  t-norm  ,  the  natural  oper¬ 
ator  corresponding  to  conjunction  ("and"',  (see  173). 


Define  the  possibility  distribution  9  by 

^(Ql  Z)  -  l-g(l-C(Z,Z,Q))  ,  (7) 

(all  Z  ) 

where  it  is  assumed  g  is  extendable  to  an  arbitrary 
number  of  arguments  (  this  is  guaranteed  if,  e.g., 
g  is  symmetric  and  associative,  which  will  be  the 
case  if  g  is  a  t-rorm  )  ,  and 

C(Z,?,Q)  =  g(  R(Z,Q),  M(2,Z)5  ,  (8) 

M(Z,Z)  -  g(Mk(Zk,Zk)) 

(k=l, . .,m) 


=  matching  table  effect  under  g  , 

(9) 


D(Z,Q)  -  g(Rt(Z,Q)) 

(t=l,..,r) 

=  relation  effect  under  g  ,  (10) 


and  Q  is  arbitrary  g  dom(Q)'.. 


For  any  confidence  levels 
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A 

=  ( d0, 

> 

(11) 

~  v  @i»  $2’"’ 

0r>  ’ 

(12) 

with  d. 

,Qt  c  Eo,U  , 

all  k,t  ,  define 

the 

oriEL- 

Inal  hypothesis  as 

H0(  *  >  6  ;8) 

^)jtMRt(Z,Q)> 

)  tc 

I  (Mfcd^Z  )>oL  )}.(13) 

k=l  K  K 

Then  (for  Z  fixed),  for  any  possibility  distribu¬ 
tion  D(£?)l  i)  as  a  function  of  Q  over  dornCQ'  ,  Z  over 
dom(Z),  yields  the  smallest  set 

«(£>£>*  ?  H0(o(  ,g;g)f  (1M 

simultaneously  for  all  possible  U  and  g  ,  when 
D  is  chosen 

D(^j|S)  =  C(Z,2,Q)  ,  (15) 

for  all  Z,£,Q  .  In  turn,  <$>  enjoys  a  similar  prop¬ 
erty  with  respect,  to  the  projection  l-g(l-*) 
applied  to  Hq  and  D. 

(For  proofs,  see  [9]  ,  section  10.) 


Thus,  the  above  theorem  exhibits  in  a  general 
setting  the  uniformly  most  accurate  single  fuzzy 
set  description  of  Q,  gi''en  Z  and  g.  The  next 
theorem  justifies  the  result  outlined  in  (c) 
above 

Theorem  2,  Posterior  Form  for  Optimal  Estimator 


For  Justification  for  the  recults  in  (a),  see  (11], 
the  sections  on  background  of  fuzzy  set  systems 
and  connections  between  fuzzy  set  systems  and 
random  set  systems.- 


3.  APPLICATIONS  TO  SPECIFIC  SYSTEMS 

I. .  Medical  Diagnosis  /  Cl assif lcatlon 

In  this  situation,  attributes  =  symptoms  ,  the 
fuzzy  relations  are  obtained  from  empirical  evi¬ 
dence,  while  the  matching  tables  are  obtained 
from  either  analytic  or  empirical  means.  The  fuzzy 
relations  refer  to  any  given  patient's  symptoms 
and  their  relations  to  his  possible  disease,  the 
unknown  parameter  here.  Thus,  dom(Q)  here  consists 
of  the  possible  diseases  of  relevancy,  with  the 
possible  addition  of  a  value  "not  known  yet-keep 
testing".  See  for  example  Q2)for  on  Artificial 
Intelligence  approach  to  medical  diagnosis  and 
[13]  for  a  similar  (but  not  justified  through  use 
of  results  similar  to  Theorems  1  or  2)  fuzzy  set 
approach.  Classification  differs  formally  little 
from  the  above  application.  (However,  this  dif¬ 
fers  considerably  from  other  fuzzy  set  approaches 
tc  classification  su-h  as  Bezdek's  (.141.) 

II.  Correlation  Problem 

Consider  firBt  a  set  of  co.ifusable  track  histories 
i  1.2,..,q)  ,say  .  Pick  out  any  i  t  j  ,  and  defin^. 
omitting  the^ obvious  subscript  dependency, 

Q  =  poss(  i  and  j  correlate,  i.e.,  belong 
to  the  same  target  source), 

Let  all  of  the  fuzzy  relations  here  be  ■'f  the  form 
of  Inference  rules.:  Thus,  linguistically,  a  typical 
Rt  corresponds  to  the  phrase 


Suppose  the  same  condition?  holds  as  in  Theorem  1 . 
Then  (J)  as  given  in  aq.(7)  is  the  posterior 
possiblli' tic  distribution  of  Q  given  7.  (  see[in) 

where  the  following  identifications  are  made: 

M(z,Z)  =  poss(Z|  Z),  (16) 

R(Z,Q)  =  poss(Q  |  Z)  ,  (17) 

and  the  sufficiency  condition 

poss(  (Q  |  Z)  j  (Z  I  Z))  =  poss(Q  |  Z)  ,  (l8) 

holds  for  all  Z,Z,Q  ,  and ’poss'’ refers  to  any  possi¬ 
bility  function  (conditional  form)  constructed  in 
accordance  with  its  corresponding  variables,  using 
possibilistic  Bayes  Theorem  (1113).- 

( Proof:  Simply  use  the  relations 

p°ss(  (2)1  ^ 

=  el  poss(  (Q  I  1}  I  (Z !?))  ,  poss(Z  I  2))  (19) 

and  then  apply  the  projection  operator  to  both 
sid^s  with  respect  to  variaole  Z.) 


"if  a  match  between  i  and  j  occurs  relative  to 
attribute  A  to  intensity  levels.  and,..., 
and  kl  1 

a  match  between  i  and  J  occurs  relative  to 
attribute  A,  to  intensity  level  o(,  , 


then  i  and  J  correlate  to  intensity  f(sCt)", 
vhere  f(s£t)  is  a  number  between  0  and  1  and  9^  is 
ihe  vector  of  ^’s  ;in  general,  both  of  these 
values  are  obtained  from  a  panel  of  experts. 

The  intensities  of  the  attribute  matches  is  most 
easily  translated  by  an  exponentiation  process 
applied  to  the  appropriate  attribute  catching 
functions.  A  simple  conversion  table  between  the 
degree  of  matching , expressed  linguistically  or 
initially  numerically  on  a  scale  from  0  (no  match) 
to  0.5  (normal  match)  up  to  1.0  (complete  match), 
might  be  established  by  use  of  the  relation 

((*))  =  x/(l-x)  >  (21) 

for  all  x  t  fO,l]  ,  where  ((x))  is  to  be  used  as  an 
exponent.  Other  translations  of  the  intensities  of 
matches  are  of  course  possible  and  may  be  more 
appropriate,  following  empirical  studies.-  (Future 
work  will  corolder  this  problem..  See  also  Dubois 


ibi 
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and  Frade  C 5 1  ,  PP-  '^6 -26k  for  similar  problems.) 

Combining  all  of  the  above  remarks,  a  reasonable 
possibilistic  model  for  inference  rule  t  is 

Rt(Z,Q)  =  ))),  (22a) 

C  (Z)  A  g  (  \  (i),  Z.  (>»)  ^(e<>iv)^,(C?b) 

(v=l,.-A)  v 

^»(3t,y)  =  l-g(x,l-y) 

*  0  (22c) 

In  this  case,  data  vector  Z  (  and  similarly  for 
variable  Z)  is  broken  up  into  the  i-data  and  J- 
data,  as  indicated  by  the  appropriate  superscript, 
with  the  previous  notation  still  holding  for  the 
attribute  indices.- 


A  summary  of  she  PACT  algorithm  is  given  below  in 
Fig.:  1  : 


Fig.  1  Outline  of  the  basic  correlation 
algorithm. 


1*. 


Based  on  three  general  results  ( ( a) ,( b) ,(  c) ),  an 
algorithm  has  been  developed  which  treats  the 
multiple  target  correlation  ^problem  ,  including 
data  categorized  as  nongeolocational..  Figure  1. 
succinctly  summarizes  the  structure  of  the  algo¬ 
rithm,  which  depends  functionally  on  the  collection 
of  relevant  inference  rules  chosen  as  well  as  the 
attribute  matching  tables.- 

A  number  of  problems  have  arisen  in  the  Implementa¬ 
tion  of  the  PACT  algorithm: 

(i)  How  should  attributes  be  chosen?  What  system¬ 
atic  procedures  are  available  for  determining  from 
available  experts  and  other  informational  sources 
what  are  the  me -it  important  and  distinct  attributes 
to  consider.-  Novakowska’s  clustering-like  approach 
[15]  or  alternatively  a  modified  factor  anaysis 
approach  might  lead  to  satisfactory  choices.; 

(ii)  In  utilizing  a  panel  of  experts,  the  way 
questions  are  formulated  is  critical.;  Consequently, 
use  of  questionnaire  and  psychometric  techniques 

to  extract  maximal  unbiased  information  is  necess¬ 
ary. 

(iii)  Perhaps  the  most  critical  problem  is  the 
actual  determination  of  the  inference  rules..  Even 
with  a  relatively  few  attributes  used  nF  a  basis, 
there  arejnyriad  combinations  of  possible  intensi¬ 
ties  of  attribute  matches  leading  to  the  corres¬ 
ponding  inference  rules.  Thus,  a  method  is  needed 
to  generate  inference  rules  which  are  relatively 
distinct  (too  many  redundant -like  ruler,  will  cause 
unnecessary  computer  running  time  without  adding 
much  information  content).;  Can  a  metric  be  designed 
which  determines  the  amount  of  "distinctness  between 
rules?  The  answer  to  these  problems  may  well  lie 
within  the  purview  of  Artificial  Intelligence  teeb  • 
nlques  or  related  search  theory  procedures  .; 

(iv)  Complete  flow  charts  for  the  PACT  algorithm 
in  its  general  form  have  been  made  (and  are  avail¬ 
able  to  interested  readers  upon  request).-  Prelimin¬ 
ary  numerical  runs  indicate  a  long  running  program. 
Consequently, by  utilizing  the  basic  bounding  prop¬ 
erty  of  t-norms  and  t-conorms  (see  ,  e,g.,(9]  , 
section  k)  ,  an  algorithm  may  be  obtained  which 

is  simpler  in  form  than  the  original  PACT  algorithm 
and  which  yields  as  outputs  lower  bounds  to  the 
posterior  correlation  distribution  : 


<t>(Ql  h  ?  lz)  , 

(23) 

1  2)  =  nmx(  0(2),  R”(Q)J  , 

i2k) 

0(2)  *  l-g(l-F(2,Z))  , 

(25) 

(all  Z) 

F(?,z),  -  gC  p.\(z),m(£,z5). 

(2b) 

R'(Z)  =  g(  1-G  ( Z) )  , 

(27) 

(t=l,..,r) 

R"(Q)  -  g(Q((f(s(t)  )))  . 

(28) 

(t=l,. . ,r) 
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